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CROSS-REFERENCE TO RELATED APPLICATION 

v 

This application Is a continuation-in-part application of co-pending application serial No. 
09/360,872 filed on July 23 1999. 

FIELD OF THE INVENTION 

The present invention relates to space-time coding schemes and to channel estimation for 
space-time encoded transmissions. The invention relates particularly, though not exclusively 
to space-time coding schemes for radio communications. 

BACKGROUND TO THE INVENTION 

The evolution of high rate data services within future wireless networks will call for new RF 
access technologies to enable substantial increases in overall system spectral efficiency at 
an acceptably low cost to the user. This assumption is based on the widely promoted view 
that data capacity requirements will grow exponentially over the next few years with RF 
spectrum remaining a scarce commodity. 



Space-Time Coding (STC) Is an antenna array processing technology currently stimulating 
considerable interest across the wireless industry. It promises to deliver substantial system 
benefits in terms of the achievable air interface spectral efficiency, and is considered for 
future high capacity, wireless data products* 



and 



STC exploits both the temporal 
which effectively mitigate fading 
upon parallel transmission paths 
efficiency). The use of multiple 
provides the means for better 
better diversity performance 
environment antenna processing 
paths, which are also referred to 



spatial dimensions for the construction of coding designs 
(for improved power efficiency) and are able to capitalise 
within the propagation channel (for Improved bandwidth 
propagation paths is fundamental to the concept, and 
to Noise Ratio (SNR) gain (i.e. power efficiency) and 
the radio link. Furthermore, in a rich multipath 
can provide access to independent, parallel transmission 
$s 'spatial modes', 'spatial channels* or 'data pipes'. 
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In Figure 3 there is shown a plo^ 
Limits of Wireless 



for the throughput (as defined by Foschini and Gans "On 
Communications in a Fading Environment when using Multiple 



y i 
J3 



Hi 

o 
ru 

SI 

Q 

.■as. 

LJ 



Antennas", Wireless Personal Coi nmunications, Vol, 6, 1998, Kluwer Academic Publishers, 
pp, 311-335) which is exceeded 1or 90% of frames (known as the '10% Outage Capacity') 
from a number of transmit elemt nts (N T ) and a number of receive elements (N R ) for the 



cases of (N t .Nr) = (1,1), (2, 2), 
different overall mean SNRs. Ih 



Rayleigh channel realisations for 
complex channel gains, H f cai 



10 realisations, and so would sen 



,4) and (1,4) over 10,000 random channel realisations for 
order to calculate fundamental per-link capacity bounds, 



coded data frames of length approaching infinity are assumed, with random, independent, 



each transmitter-receiver path. Thus an (N Ri N T ) matrix of 
be constructed with independent complex Gaussian 



elements. A transmitter would have no prior knowledge of the instantaneous channel 



equal-power uncorrected noise-like waveforms to the 
different antennas. It can be sho 1 vn that the fundamental capacity (i.e. maximum throughput 
for vanishingly small error probal ility) for such a system would vary from frame-to-frame, as 
a function of the eigenvalues of tt te matrix product HH H . 

15 There are large 'outage capacity 1 Increases for N T = N R = N = 1,2 and 4. This is due to the 
combined effects of transmitter/receiver diversity (which draws the 'worst cases' closer to the 
median cases), receiver SNR power gain, and parallelism. These increases can be 
considered as a capacity increase at a given receiver SNR, or conversely a reduction in 
required SNR for a given link capacity. Note that the case of (N Tl N R )=(1 l 4) is much better 

20 than (1,1 ) due to the extra SNR gain and diversity, even though there is no parallelism to be 
exploited with (1 ,4). However, there are still additional benefits in going to (4,4) in terms of 
diversity and parallelism (offering an additional multiplicative factor). 

It is commonly accepted that the capacity of CDMA cellular systems Is Inversely proportional 
25 to the working Eb/N 0 of the subscribers. In the reverse-link case, it is the received E b which 
must be minimised, whereas for the forward-link our aim is to minimise the transmitted Eb 
(where E b denotes energy per informa ion bit). Thus the per-link throughput increases as a 
logarithmic function of link E b (in the m? inner prescribed by Shannon, 'Communication in the 
Presence of Noise 1 Proc. IRE, vol. 37, pp. 10-21, Jan. 1949), whereas the number of links 
30 which can be supported is in inverse proportion to E b . If we consider the overall network 
capacity to be the product of the per-ii \k capacity and number of links, then we obtain the 
characteristic shown in Figure 4. This smows that systems with larger numbers of links, each 
working at a lower Eb/N c , and hence lower throughput, will have a larger network capacity 
(i.e. higher spectral efficiency) than systems working at a higher link BJM at but with fewer 
35 links. This Is independent of any stafistical multiplexing benefits which apply for large 
numbers of users, and this further favours systems with large numbers of low-SNR/low- 
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throughput links. TUis analysis relies upon the assumption that the system is not bandwidth 
limited (i.e. in the cfise of the CDMA forward link this requires that there be a plentiful supply 
of Walsh codes). 



m 



5 It can therefore be deduced that, for a CDMA network which is severely interference~ilmited t 
every 3dB reduction in Eb/N 0 at the same link throughput and error rate provides a 2-fold 
capacity benefit* For other system scenarios, for example fixed directional-antenna systems 
such as Nortel Networks' Fixed Wireless Access product Proximity II \ or other 
mobile/nomadic or fixed-wireless applications where there are multiple diversity antennas at 

10 the terminal, the arguments are reversed. The interference level is tow, but the number of 
frequency channels/timesfots (TDMA) or orthogonal Walsh codes (CDMA) is limited. System 
capacity should be enhanced by using more bandwidth-efficient modulation, that is, 
modulation with a higher spectral efficiency in terms of bps/Hz. In the case of bandwidth- 
limited systems, every 2-fold increase in coder/modulation spectral efficiency at the same 

15 error rate provides a 2-fold capacity benefit. In terms of the capacity vs. SNR curves of 
Figure 3, there is a goal to increase the bps/Hz axis. 
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Two major applications are currently envisaged for STC technology; (i) Low mobility or 
nomadic high rate wireless terminals (I.e. laptop or Personal Digital Assistant (PDA) 
20 versions) which could be expected to operate both outdoors and indoors; (ii) Fixed terminals 
deploying indoor or outdoor antennas. The propagation channel will differ significantly 
according to the application scenario in terms of the multipath angle spread and time 
dispersion. The angle spread phenomenon, in particular, will have significant impact on the 
link performance gain using Space-Time Coding. The terminal antenna design and its 
25 interaction with the surrounding propagation environment also influence the attainable 
performance gain of the STC configuration. It is envisaged that STC technology will be 
applied to a range of possible systems Including EDGE, 3 rd Generation WCDMA, future fixed 
wireless access, and 4th Generation systems. 



35 The fundamental principle of 
transmitted simultaneously frcjm 



^ 30 Several key implementation I «ues influence the achievable performance gain using STC. 

Since STC capitalises on the inherent parallelism in the spatial channel, one possible risk 
concerns the substantial Incre* se in DSP load needed to estimate the multiplicity of channel 
impulse responses at the term! lal. 



STC, Is illustrated in Figure 1. Coded data symbols are 
multiple antennas. This simultaneous transmission of 
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modulation symbols from different antennis is termed a Space-Time Symbol (STS). Since 
modulation symbols can conventionally b< > represented by a complex number, an STS can 
be represented by a vector of complex nur nbers, with the number of complex elements in the 
vector equal to the number of transmit antennas. In a Frequency Division Multiple 
Access/Time Division Multiple Access (fipMA/TDMA) system these simultaneous symbols 
use the same carrier frequency and safne symboling waveform. In a CDMA system an 
identical symboling waveform, in the forrp of an identical spreading code (i.e. Walsh code) is 
also used. 



Assuming a non-dispersive channel, thfe receiver simultaneously detects all of the elements 
of a transmitted STS using a single/ symbol-matched filter per receiver antenna. These 
detection outputs are built up into a ft/ecfor) detection statistic. Thus Figure 1 shows that 
every element of this vectorial receiver detection statistic is a superposition of the multiple 
simultaneous transmissions, as seen fat each receiver antenna. This cross-coupling between 
antenna transmissions can be thought of as a form of intersymbol interference (ISI). Each 
element, viewed over many STS, wdl thus resemble a somewhat disorganised Quadrature 
Amplitude Modulated (QAM) type co retaliation (in the noise-free case), where the exact form 
of this constellation depends on the} STC encoder structure, STC modulation alphabet and 
instantaneous channel. 

In the case of multiple receive anter nas the transmissions from individual transmit antennas, 
however, may not be totally (spatially) separated at the receiver. This spatial-ISI can be 
overcome in a number of ways, incwding; by a form of antenna spatial 'nulling 1 (analogous 
to linear 'zero-forcing' time-domairl equalisation) or spatial Minimum Mean-Squared Error 
(MMSE) beamforming (analogous to linear MMSE time-domain equalisation); or, subtraction 
(analogous to Decision Feedback time-domain equalisation); or, maximum-likelihood 
sequence estimation (MLSE) of oded Space-Time Symbols (and hence information bits) 
from all transmit antennas, based pn observations of the vectorial receiver decision statistic 
over the whole coded frame. 



Using conventional methods (and 



apparently good STC systems car 
by achieving increased bandwidth 



with a bounded receiver complexity), waveform design is 



inevitably a compromise between bandwidth efficiency and power efficiency. However 



seemingly 'side-step' the above barrier (Shannon bound) 
efficiency with little or no reduction in power efficiency. 
In principle, STC transmissions could be successfully received in a link with only a single 
receiver antenna. However, the piwer efficiency would be very poor, because it would not 
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be possible to capftalise ort the additional SNR and diversity gain provided by a multiple- 
antenna terminal design, and due to the lack of multiple parallel decoupled 'data pipes 1 . Put 
more simply, and referring/ again to Figure 1 (assuming only a single receiver antenna), the 
receiver detection statistfc would only have a single element and it would resemble an 
5 extremely high-order QAM constellation (but which changes both in size and shape as the 
channel fades at the Doppler rate). This high-order constellation would have a large number 
of constellation points, all lying in the same 1 -dimensional complex space. Hence, like QAM, 
the power efficiency of such a link would be rather poor. In the limit, two constellation points 
may even lie on top of 6ne another, and the receiver would require redundancy (memory) in 
10 the coding in order to separate them. 

3j There are, of course, other proposed increased capacity transmission and reception 

01 techniques besides STC, some of which are alternatives to STC, while others may be 

^ complementary. One possible technique is fixed beamforming' (FB), which in the case of the 

SI 15 BTS implies a higher order of sectorisation. FB can actually be complementary to STC, 
although if both techniques were to be used simultaneously, the number of antenna 
s elements required would Increase multiplicatively. 

D 

Figure 2 illustrates a possible range of applications for Space-Time Coding technology: Low 
Si 20 mobility applications, which include pedestrian and nomadic terminals - where terminals 
g would be in the form of PDAs or some other form of personal communicators and laptops, 

" which would be expected to operate in both outdoor and indoor environments; in-building f 

wireless LAN, where the propagation channels often have a quite limited delay spread but 
would typically exhibit very rich multipath (angle) scatter; Wideband fixed wireless access 
25 (e.g. Multi-channel Multi-point Distribution Service [MMDS]) Involves Space-Time Coding 
and adaptive antenna processing at the subscriber and could be exploited to provide 
enhanced link SNIR margin to increase system capacity and enable simpler installations 
using indoor (non-directional) antennas; and in-vehicle, high mobility applications. 

30 In many cases, however, the capacity limits of both interference and bandwidth (Walsh 
code) have been reached, thus leaving no conventional means of increasing capacity. 

It will be noted from the above, that the signal seen at each receive antenna is a 
superposition of the multiple simultaneous transmissions from each transmit antenna. With 
35 reference to the channel model of Figure 11, (and as described in V. Tarokh, N. Seshadri 
and A. Calderbank, "Space-time codes for high data rate wireless communication: 
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Performance criterion and Code Construction," IEEE Trans. Inform. Theory 44(2):744-764. 
March 1998) at each symbol interval / the transmitted vector x = (Xi.Xa,...,*) is received 
according to y[/1=H[/]x[/]+n[/]. Where y[/) is a sequence of received r-vectors, x[/] is a 
sequence of transmitted f-vectors, n[/] is a sequence of noise vectors and H is an r x f matrix 
representing the channel coefficients. 

Accurate channel estimation (i.e. estimation of the complex gains between each 
transmit/receive antenna pair) is important for reliable decoding of space-time codes. This is 
because the maximum ikelihood branch metric M(x,y) for a received vector y and 
hypothesised transmit synr^ol x is given by M (x, y) = ||^ - Hx\\ \ 

Thus in a space-time decoder it is necessary to calculate the Euclidean distance between 
the received vector and what would have been received if x was transmitted. It is only 
possible to find out what would have been received if the channel gains are known. 

/The maximum likelihood sequence estimator (implemented by the Viterbi algorithm) outputs 
'the sequence that minimises ttje sum of such branch metrics over the window of interest. 
The channel parameters are required in calculation of these maximum likelihood metrics, if 
these parameters are not knoWn exactly, the discrepancy between the parameters used and 
the actual channel parameters has an effect similar to having an increased amount of 
thermal noise, which degrades performance. This is in contrast to a single antenna additive 
noise channel, in which knowledge of the signal amplitude or noise variance is not required 
for the Viterbi algorithm (although it is required for the maximum a posteriori [MAP] 
algorithm). 

Using the analysis given in V. Tarokh, A. Naguib, N. Seshadri and A. Calderbank, ("Space- 
time codes for high data rate wireless communication: Performance criteria in the presence 
of channel estimation errors, mobility and multiple paths," IEEE Trans. Commun. 47(2):199- 
207 Feb. 1999), it can be shown that If the channel uncertainty is modelled as an additive 
noise term with variance v on each of the channel gains, a loss in coding gain of 
approximately 

10logio(vfy)dB (1) 
is suffered, where f is the number of transmit antennas and y=EJN 0 is the signal to noise 
ratio (SNR). This is especially bad, since the channel estimation noise variance is amplified 
by both the number of transmit antennas, and the operating SNR, 
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Since the channel gains H ar 5 not known a-priori, they must be estimated. The conventional 
method is to transmit a known sequence of data called a training sequence, which is used by 
the receiver to produce est mates of the channel gains. Since the performance of the 
decoding step depends on the accuracy of these channel estimates, a minimum mean 
square error estimator (MMSE) may be used. This would minimise the variance of the error 
on the channel estimates, wlich in turn minimises the implementation loss described in 
Equation 1* 

The typical behaviour of MWSE estimators is that their error variance decreases as the 
inverse of the length of the training sequence. In other words, approximately twice as many 

in order to halve the mean squared error in the channel 
the training sequences has an impact on achievable system 



training symbols are needed 
estimates. Thus the length o 
performance. 



The use of long training sequences, however, goes against the main goal of space-time 
coding, which as discussed above, is to provide high-rate, spectrally efficient 
communication. Such training sequences use up power and bandwidth which could instead 
have been used to transmit data. 

In the cdma2000 system, for example, pilot sequences transmitted on orthogonal carriers 
may be used for the purposes of downlink channel estimation. While the use of pilot 
sequences appears not to affect the spectral efficiency (since the amount of data transmitted 
in a packet is unchanged), closer examination indicates that this is not the case- An 
additional pilot per transmit antenna is required which Increases the total transmit power at 
the base-station. Also, additional Walsh codes must be dedicated to these pilots. 
Furthermore, the pilot channels produce interference for users in other cells. 



It is therefore desirable to reduce the amount of training symbols and/or the total pilot power 
required, without affecting the performance of the decoder. This in turn will provide 
30 increased capacity. 



OBJECT OF THE INVENTION 

The present invention seeks to provide an improved space-time coding scheme and an 
Improved channel estimation scheme. The present invention further seeks to provide a 
35 space-time coding scheme having an increased spectral efficiency. A still further object of 



7 



the invention Is to provide space-time coding scheme having an increased improved 
bandwidth and power efficiency. 

SUMMARY OF THE INVENTION 

In accordance with a first aspect of the invention there is provided Space-time coding 
apparatus comprising a data input, a plurality of signal outputs couplable to a respective 
plurality of transmit antennas, a trellis encoder implemented using a convolutional encoder 
and arranged to receive data from the data input and to produce encoded data in the form of 
code symbols, a modulator arranged to map the encoded data to predetermined modulation 
symbols (e.g. QPSK), and a de-multiplexer arranged to de-mufflplex the modulation symbols 
to the signal outputs. 

Preferably the input receives binary data and the conventional encoder groups sequentially 
input data into four-bit groups, which are processed by the encoder to provide eight-bit data 
groups which are subsequently converted to QPSK symbols, 

p re f era bly the trellis encoder comprises a convolutional encoder having a shift register with 
two parts operable to generate two code signaiyor each data bit input to the trellis encoder, 
wherein the code signals are encoded by a generator function, wherein the two coded 
signals are switched to an output of the encoder. 

Preferably the set of signal outputs comprise channellisers, radio frequency mixers, and 
amplifiers, and may also comprise radio frequency transmit antennas. 

The space-time coding apparatus may be arranged to provide multidimensional space-time 
codes, wherein multiple space-time symbols are output for every trellis transition. 

In accordance with a second aspect of the present invention there is provided a method of 
space-time encoding comprising receiving a stream qf data at an input, trellis encoding the 
data stream using a convolutional encoder operable to sequentially group data to provide 
coded bits, outputting the encoded data to a modulator, operating the modulator to map the 
encoded signals to QPSK symbols to provide modulated signals, and operating a de- 
multiplexer to switch the modulated signals to a set of signal outputs. 
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Preferably the Input receives binary data and the convolutional encoder groups sequentially 
input data into four bit groups, which are processed by the encoder to provide eight-bit data 
groups which are subsequently converted to QPSK symbols. 

5 Preferably the trellis encoder comprises a convolutional encoder having a shift register with 
two parts operable to generate two code signals for each data bit input to the trellis encoder, 
wherein the code signals are encoded by a generator function, wherein the two coded 
signals are switched to an output of the encoder. 

10 In accordance with a third aspect, the invention provides a method of space-time encoding a 
data stream comprising the steps of trellis encoding a data stream using an optimal binary 
convolutional code of predetermined constraint length, modulating the encoded data by 
mapping the encoded data stream to modulation symbols selected from a predetermined 
modulation alphabet, and de-multiplexing the modulation symbols to a plurality of transmit 

15 antennas. 

In accordance with a fourth aspect, the invention provides a method of estimating complex 
channel gain in a space-time communications system comprising the steps of generating an 
initial, relatively coarse estimate of channel gain, receiving space-time encoded information 

20 symbols over the channels, decoding the information symbols using the initial channel 
estimate, to produce a sequence of coded symbol probabilities, calculating a sequence of 
coded symbol averages based on the coded symbol probabilities, refining the channel 
estimate for each channel, by processing the received symbols to remove the expected 
effect of the transmissions carried by all the other channels by performing soft cancellation 

25 using the relevant parts of the sequence of symbol averages, and to remove the expected 
effect of modulation on each symbol, and by averaging the channel estimates over all 
symbols for each respective channel to produce a refined estimate for each channel, 
decoding the information symbols again using the refined channel estimate, to produce a 
refined sequence of coded symbol probabilities, repeating the calculating, refining and 

30 decoding steps until convergence, and decoding the information symbols using the final 
channel estimate and using hard decisions. 

The invention also provides in another aspect, a computer program which, when executing 
on suitably configured hardware, causes the hardware to perform the steps of the above 
35 method. 



In a further aspect, the Invention provides channel estimation apparatus for a space-time 
communications system comprising an initial estimator operable to generate an initial, 
relatively coarse estimate of channel gain, a receiver operable to receive space-time 
encoded information symbols, a decoder operable to produce a sequence of coded symbol 
5 probabilities by decoding received information symbols using a channel estimate and to 
decode the Information symbols using a final channel estimate using hard decisions, 
averaging means operable to calculate a sequence of coded symbol averages based on the 
coded symbol probabilities, and estimate refining means operable to refine the channel 
estimate for each channel, by processing the received symbols to remove the expected 
10 effect of the transmissions carried by all the other channels by performing soft cancellation 
using the relevant parts of the sequence of symbol averages, and to remove the expected 
_2j effect of modulation on each symbol, and by averaging the channel estimates over all 

Sj symbols for each respective channel to produce a refined estimate for each channel. 

03 

Nf 15 In accordance with a further aspect of the present invention there is provided a 
J~*j communications system wherein space-time coding is employed by the system Including 

s transmitting apparatus and receiver apparatus wherein the transmitter apparatus comprises 

N* an input, a trellis encoder, a de-multiplexer, and a set of signal outputs wherein the input is 

operable to receive a stream of data; the trellis encoder is operable to encode the data and 
Nj 20 which is operable to output encoded data to the de-multiplexer, which de-multiplexer 
switches signals to the set of signal outputs; and wherein the trellis encoder comprises a 
convolutional encoder operable to sequentially group data to provide coded data, which 
encoded data is mapped to provide modulated symbols. 

25 In another aspect, the invention provides, a communications system arranged to use space- 
time coding, and comprising transmitter apparatus and receiver apparatus, wherein the 
receiver apparatus includes an initial estimator operable to generate an initial, relatively 
coarse estimate of channel gain, a receiver operable to receive space-time encoded 
information symbols, a decoder operable to produce a sequence of coded symbol estimates 
30 using a channel estimate* and estimate refining means operable to refine the channel 
estimate for each channel, by processing the received symbols to remove the expected 
effect of the transmissions carried by all the other channels by performing cancellation using 
the relevant parts of the sequence of symbol estimates, and to remove the expected effect of 
modulation on each symbol, and by averaging the channel estimates over all symbols for 
35 each respective channel to produce a refined estimate for each channel, the apparatus 



O 
o 



10 



being arranged to iteratively repeat the refining of the channel estimation and the production 
of symbol estimate functions until convergence of the channel estimates occurs. 

In accordance with a still further aspect of the present invention there is provided a computer 
processing chip operable to encode at least a stream of data and which is operable to output 
encoded data wherein the chip comprises a convolutional encoder operable to sequentially 
group data to provide coded bits to provide modulated symbols. 

In accordance with a still further aspect of the present invention there is provided software 
preferably stored on a machine readable medium, which is operable to cause suitable 
hardware to encode at least a stream of data and which is operable to output encoded data 
wherein the software is programmed to function as a convolutions! encoder operable to 
sequentially group data to provide coded bits, which coded bits are mapped to provide 
modulated symbols. 

A Modified Convolutional Space-Time Trellis Coder (MC-STTC) has thus been developed 
which compares well against other known Space-Time Coding algorithms. This new design 
(described in detail below) provides an Impressive coding gain and demonstrates that 
substantial performance gains are possible compared to radio links based on conventional 
antenna configurations (i.e. those that do not exploit Space-Time Coding). 

Simulations indicate that a 256-state MC-STTC algorithm using four antennas at both the 
transmitter and receiver can provide a 12.5dB improvement in power efficiency over a single 
transmit antenna, single receive antenna, coded baseline configuration under quasi-static, 
flat (uncorrected) fading. At the same time the MC-STTC arrangement provides a four-fold 
improvement in spectral efficiency by virtue of the parallel transmission. The MC-STTC 
arrangement also provides an 8.7dB power efficiency gain over a transmit diversity baseline 
implementing an STTD block code concatenated with a k=^9 binary convolutional encoder, 
when operating under the same flat fading conditions. 

An analysis of the DSP complexity of these new STC schemes has shown that the 256-state 
MC-STTC algorithm can produce approximately 2.2 times the processing power of the two 
benchmark configurations in relation to the demodulator soft metric processing and Viterbi 
maximum likelihood sequence estimate (MLSE) processing. 



In the cases where the limits of interference and bandwidth are approached, the required 
SNR of the transmission scheme employed should be reduced and the spectral efficiency 
should be increased. Through a combination of terminal antenna diversity, coupled with 
Space-Time Coding (assuming suitable code design and appropriate channel behaviour), 
5 the above system design objectives can be met. 



BRIEF DESCRIPTION OF THE DRAWINGS 

In order that the preient invention is more fully understood and to show how the same may 
be carried into effecl reference shall now be made, by way of example only, to the figures 
1 0 and tables as shown lp the accompanying drawing sheets, in which:- 
Figure 1 shows a system model of space-time coding; 
Figure 2 shows STC target applications; 
Cn Figure 3 shows a 10% outage capacity (bps/Hz) with correlated fading; 

Figure 4 shows relative capacity vs. per-link SNR for a CDMA network; 
1 5 Figure 5 shows candidate approaches to space-time coding; 
Figure 6 shows an MC-STTC transmitter and receiver; 
Figure 7 shows a convolutlonal encoder in detail; 
H Figure 8 shows how present state signals are mapped onto next state signals. 

Figure 9 shows a proposed SC-STTC coder/decoder structure; 
20 Figure 10 shows a comparison of spectral efficiency and Eb/N Q for a number of 
coding/decoding schemes; 

Figure 1 1 is a schematic diagram of a typical STC channel model; 
Figure 12 is a schematic representation of an exemplary STC communications system; 
< y^1)MS Figure 13 A an^ 13B are a flowchart showing the joint channel estimating and decoding 
25' technique; 

Figure 14 is a plotted simulation result indicating the relationship between pilot power and 
decoder performance for a conventional receiver; 

Figure 15 shows \indicative performance results for a decoder in accordance with the 
invention; 

30 Table 1 shows possible cods designs for MC-STCC-N for different N, where N=2 M ; 
Table 2 shows simulated performance of STC algorithms; and 
Table 3 shows STC algorithm complexity (at a data rate of 153.6kbps), 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
35 There will now be described by way of example the best mode contemplated by the 
inventors for carrying out the invention. In the following description, numerous specific 
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details are set out in order to provide a complete understanding of the present invention. It 
will be apparent, however, to those skilled in the art that the present invention may be put 
into practice with variations of the specific. 

5 In 3G CDMA air interfaces such as the cdma2000 air interface, Space-Time Coding is a 
candidate coding technique. The first stage of applying STC is to have the required number 
of antennas and receiver chains at the terminal. In the case of a terminal having 4 receiver 
chains then there should be an average benefit of 6dB SNR gain (under ideal conditions), 

f plus some additional diversity gain provided that interference is uncorrected. If the 4 
10 receiver channels have equal mean power, then the 6dB SNR gain will be achieved 
regardless of the degree of correlation between the fading seen at these different antennas. 
The diversity gain achieved will be dependent upon the signal fading correlation. 



EDGE is a recently-introduced enhancement to GSM, potentially using a more complex 
15 trellis coding and higher-order modulation format within the familiar GSM burst structure. 
Although It is believed not to have been proposed previously, a possible future enhancement 
to EDGE could be to replace the currently-proposed Trellis-Coded Modulation (TCM) 
technique with an appropriate STC technique. Unlike most proposed 3G mobile CDMA 
systems, EDGE maintains orthogonality between users on the reverse link (since no two 
j 20 users in a given sector will share the same timeslot and carrier frequency). Thus the 
potentially high spectral-efficiency of STC techniques makes them attractive on the EDGE 
reverse link (uplink) as well as the forward link (downlink), whereas for CDMA systems it is 
predominantly the latter which is of interest, since the reverse link traditionally uses random 
spreading codes. For the case of a flat-fading (non-dispersive) channel, the performance 
25 and complexity of an STC scheme applied to EDGE can be readily analysed. In the event of 
a dispersive channel, however, then the performance becomes more difficult to analyse and 
the best approach in this dispersive case would be to treat the effect of the dispersive 
channel as a form of trellis coding and therefore to treat the concatenation of the Space- 
Time Coder and channel as a serially-concatenated code. Serially-concatenated codes are 
30 described in more detail below. 

The DSP processing complexity of the STC demodulation and decoding processing for a 
number of candidate Space-Time Codes is discussed below. The discussion is generic to 
various different multiple-access system configurations (e.g. CDMA/ TDMA). However, there 
35 are other areas of transmitter and receiver analogue and digital signal processing where 
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STC may imply additional complexity. This arises directly from the fact that the use of STC 
implies greater numbers of transmitters and receivers at each end of the link. 

The two best-known Orthogonalised Space-time coding schemes are a) Space-Time 
5 Transmit Diversity (STTD), a special case of STBC, shown in Figure 5. which was first 
described by Alamouti, and b) Orthogonal Transmit Diversity (OTD). The principle of both 
schemes is to map coded modulation symbols to two transmit antennas. It has been shown 
that STTD is superior to OTD (typically around 0.5dB better in terms of EtJK,). since it maps 
all modulated symbols to both transmit antennas, whereas OTD only maps each modulation 
10 symbol to one of the antennas, and relies on the memory within the coder to obtain the 
diversity advantage. Note that both STTD and OTD have the desirable property that they 
J don't expand the bandwidth of the signal, and thus in a CDMA system they don't use up any 

m more Walsh codes than the baseline. Furthermore, the receiver can process the received 

^? signal using linear processing, avoiding the requirement for more complex a posteriori 

Sj 15 probability (APP) detection of the Space-Time Block Code (STBC), shown in Figure 5(i). 
^ Using STTD and OTD it is possible to extract diversity without having to sacrifice spectral 

^ efficiency, since the codes are 'rate 1*. 
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A further approach is the Bell Labs' Layered Space-Time Processing (BLAST) approach to 
20 STC (Foschini, G J; 'Layered space-time architecture for wireless communication in a fading 
environment when using multi-element antennas', Bell Labs Technical Journal, Vol. 1, No. 2, 
Autumn 1996, pp. 41-59.)- The basic concept behind layered STC is illustrated in Figure 5 
(li). The information bits are demultiplexed into individual streams, which are then fed into 
individual encoders. These coders may be binary convolutional coders, or even have no 
25 coding at all. The outputs of the coders are modulated and fed to the separate antennas, 
from which they are transmitted, using the same carrier-frequency/symbol waveform (TDMA) 
or Walsh code (CDMA). At the receiver/a spatial beamforming/nulling (zero-forcing) process 
is used at the front end in order to separate the Individual coded streams, and feed them to 
their individual decoders. The outputs of the decoders are multiplexed back to reconstruct 
30 the estimate of the original information bit-stream. 

There are a number of disadvantages to the above zero forming receivers in that some of 
the diversity potential of the receiver antenna array is necessarily sacrificed in the 
beamforming process. In order to overcome this problem, layering the receiver processing. 
35 using Multi-User Detection (MUD) techniques such as subtractlve cancellation have been 
proposed. The disadvantage with this layered approach, like all subtractive multi-user 
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detection schemes, is that wrong detection decisions higher up in the chain can adversely 
affect performance lower down. The receiver described in detail below could potentially be 
used to improve the performance of a BLAST receiver, for example, by using soft- 
cancellation 

5 

Tarokh et al have generalised Alamouti's STTD concept ('Space-Time Block Coding for 
Wireless Communications: Theory of Generalised Orthogonal Designs' Submitted to IEEE 
Trans. Inf. Theo.) to larger numbers of transmit antennas, with their so-called Theory of 
Generalised Orthogonal Designs', which they claim is based on a well-known matrix problem 
10 in the mathematics literature, the so-called 'Hurwitz-Radon* problem. The authors claim that 
for complex modulation constellations there are no generalised orthogonal designs of rate 1 
3 which map all input symbols to all transmit antennas (yet still allowing simple linear 

processing at the receiver) for N T >2. They do, however, present rate 3/4 and rate 1/2 
designs for Nt=4, for example. This technique is known as Orthogonal Space-Time Block 
15 Coding (OSTBC). 



Higher-order orthogonalised designs offer Improved diversity, but at the price of a reduced 
b! spectral efficiency, and have yet to offer any intrinsic coding gain. They thus consume 

p{ valuable Walsh codas In a CDMA system, or timeslots in a TDMA system. Such designs are 

SI 20 therefore not attractive for multiple receiver (i.e. terminal) antennas applications, since 
system capacity would be predominantly bandwidth-limited rather than interference-limited. 



As a way to avoid this spectral efficiency reduction, a hybrid OTD/STTD scheme has 
recently been proposed, which uses four transmit antennas, but forms a rate 1 block code. 
25 However, this doesn't achieve full 4-way antenna diversity on every coded symbol. Instead, 
it achieves 2-way antenna diversity on every coded symbol (like STTD), and further diversity 
is then achieved (up to our full 4-way) through the coding structure (like OTD). 



Space-Time Trellis Coding (STTC) is a generalisation of Trellis-Coded Modulation (TCM). 

30 The basic structure of the STTC encoder is illustrated in Figure 5{iii) and the encoder can be 
considered in the most general sense as a finite-state state-machine. The value of the latest 
group of information bits is used to determine the transitions which will occur between 
present state and next state. The result of this transition is the transmission of what is known 
as a 'Space-Time Symbol 1 (STS), which (as noted above) is actually equivalent to the 

35 simultaneous transmission of a group of symbols; one from each of the transmit antennas. In 
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principle, it could be arranged for the encoder to transmit a sequence of Space-Time 
Symbols in response to a single trellis transition (branch). This 'multidimensional 1 approach 
would inevitably lead to a reduction in the code's spectral efficiency, but potentially could 
improve power efficiency. The constituent symbols of the STS could, for example, be 
chosen from any set, such as QPSK, 8-PSK or 16-QAM, However, in the simulations 
presented later in this section only the QPSK case is considered. 

The operation of the STTC decoder described in mathematical terms above will now be 
further explained. 

The decoder employs a Viterbi algorithm, and thus is necessarily somewhat complex. The 
task of the STTC decoder is to track the progress of the encoder state machine, based on 
the observable signal received on (potentially) a multiplicity of receiver antennas. The usual 
approach (for a non-concatenated STTC) is to employ the Viterbi algorithm in order to form 
the maximum-likelihood ('hard') estimate of the original information sequence. The Viterbi 
algorithm requires at its input, for each 'branch' of the decoding (i.e. for each transition from 
present state to next state), a set of log-likelihood ratios (LLR) for the set of all possible 
hypotheses of the received STS. This set of LLRs is generated (as discussed above) by the 
preceding "demodulator 1 stage based on squared Euclidean distances (In the tVIVary 
complex vector space of the receiver baseband processor( IM L is the number of time 
dispersed paths, assuming a spread-spectrum waveform) between the hypothesised 
received symbol (for the trellis branch under consideration) and the actual received signal. 
As discussed in Massey (The how and why of channel coding' Proc. International Zurich 
Seminar on Digital Communications, pp. 67-73, 6-8 March 1984), the 'modulator 1 must be a 
memoryless device, and consequently the demodulator is defined as a device to 'extract 
likelihood information about the possible transmitted signal in every symbol period'. This 
likelihood Information is subsequently passed to the decoder. Note that as the channel 
changes (due to Doppler), this set of hypotheses for the received STS will also change. 

It can be seen that the complexity of such a maximum-likelihood receiver is therefore a 
function of the number of states in the trellis, the size of the set of possible STS (i.e. the set 
of possible received-STS hypotheses), the number of branches leaving a state (which also 
determines the code's spectral efficiency), the number of receiver antennas, Nr, the number 
of time-resolvable paths, N L , and the Doppler rate of the channel (which affects the rate of 
update of the hypotheses). 
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STTC codes have been analysed and heuristics to design STTCs have been devised 
(Tarokh V., Seshadri N. and Calderbank A.R.; 'Space-Time Codes for High Data Rate 
Wireless Communications; Performance Criterion and Code Construction 1 , IEEE 
5 Transactions on Information Theory, Vol. 44, March 1998, pp- 744-765) which are believed 
by some to be optimal (for a given number of states) and optimal STTC codes for Nt=2, are 
employed in comparisons below. 

In the embodiments of the invention described below, a class of codes, conveniently termed 
10 'Modified-Convolutional Space-Time Trellis Codes' (MC-STTC) have been devised. The 

~ technique comprises the taking of an optimal binary convolutional code of a given constraint 

O 

ug length, mapping the outputs to QPSK symbols, and de-multiplexing the result to the different 

8} transmit antennas. The MC-STTC technique thus provides for de-multiplexing to take place 

after the coding and symbol mapping (modulation). The MC-STTC equivalent trellis 
15 structure can be readily determined in the form of the conventional STTC, and hence 
decoded in the same manner. The detailed properties of this MC-STTC class of codes are 
not assumed to be optimal for all channel types and antenna configurations, but simulation 
results show a promising performance. 
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20 With non-concatenated STTC techniques, coding complexity, and hence performance, can 
be increased by increasing the number of trellis states. The drawback is that the decoder 
complexity also scales linearly with the number of states whereas, based on experience with 
conventional convolutional codes, diminishing returns in terms of power efficiency are 
expected. 

25 



he binary convolutional encoder 900 is shown in greater detail in Figure 7. Incoming binary 
data 902 grouped into units c f 4 information bits, is input into a shift register 904 having a 
constraint length K, of 9. The rate is one half with two modulo 2 summers, 906, 908 
generating two binary coded Wymbols for each data bit input to the encoder. The code 
30 symbols are output so that the code symbol (Co) encoded with generator function s 0 , is output 
first, and the code symbol (ci) encoded with generator function si, is output last. The state of 
the convolutional encoder, upcn initialisation, shall be the all-zero state. The first code 
symbol output after initialisation shall be a code symbol encoded with generator function s 0 . 
The generator functions of the c ode shall be s 0 equals 763 (octal) and si equals 561 (octal) 
35 as per Table 1 (Source Odenwal der (1970) and Larsen (1973) ). 
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Switch 910 switches the outputs from the summers so that 8 coded bits are mapped by 
QPSK mapper 914 to four QPSK symbols. Optional unit 912 re-orders bits pseudo- 
randomly in groups of 8, The QPSK mapper provides 4 QPSK symbols which are de- 
multiplexed by de-multiplexer 916 to provide one space-time symbol (STS) which is 
5 transmitted by all antennas 918-924. The QPSK mapper could provide natural mapping or 
gray mapping. Figure 7 shows an MC-STCC-256-4:4 (since K=9, 2 k " 1 =2 8 «256). A receiver 
suitable to receive the STTC signals can be constructed as is known for the STTC receiver 
as described with reference to Figure 5(iii), the present invention providing an improved 
coding scheme and is generally applicable to STTC schemes, the invention is not limited to 
10 a particular type of transmitter and receiver group. 

e ; 

5 Figure 8 shows a state diagram for this embodiment. There are 2 4 =16 transitions from each 

EH present state to each next state and so each transition carries 4 bits of information. Each 

m 'branch' or 'edge' of the trellis, between a 'present state* and 'next state' is associated with 

15 transmission (and reception) of a Space-Time Symbol. In this example, a STS consists of 
four QPSK symbols, simultaneously transmitted from each transmitter antenna. 



H» In an alternative embodiment, the convolutions! coder could be a quarter rate coder whereby 

Py four information bits generate sixteen coded bits, which in turn provide eight QPSK symbols 

M 20 to provide two space-time symbols for each input data block (i.e. for each trellis transition). 

5 This is an example of a 'multidimensional 1 Space-Time Code. In such a way, the MC-STTC 

D 

technique can be extended to other rate convolutional coders, other modulation symbol 
alphabets and other numbers of outputs (e.g. antennas) to generate any Space-Time code 
of desired dimensionality. Indeed we can apply this 'multidimensional' approach to any 
25 Space-Time Trellis Code, whereby multiple STSs are transmitted for every trellis branch 
(every state transition of the encoder finite state machine). 

Turning to the receiver, the action of the conventional Viterbi receiver is to find the single 
survivor (i.e. the most probable) path through the whole trellis, from Start State to End State. 
30 This is achieved by finding the path with the lowest state metric (highest probability) which 
as described above, is found by summing the branch metrics for each branch in the path. 

The receiver also uses an array of 4 antenna elements, the signals from which are amplified, 
down-converted, and digitised using IF and ADC modules. As in the transmitter, a single 
35 board of 4 channellisers is used to perform channel filtering, and signals are then routed to a 
DSP card where the STC receiver processing is performed. The DSP card is controlled by a 
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controller processor, and received data is fed to the controller for routing to the end user, etc. 
As with the transmitter, common frequency references are used across each of the 4 receive 
paths, and the sampling instants of the ADCs are also synchronised. 

When STC is applied to a 3G CDMA system, the individual components of the Space-Time- 
Symbols (STS) for the Forward-Link (FL) data channels are modulated simultaneously on 
the multiple transmit antennas. This is carried out using identical outer spreading and 
identical Walsh codes. It is in this way that the bandwidth efficiency benefits of STC can be 
achieved through effective 'reuse 1 of Walsh codes at the different transmit antennas. The 
receiver must place a rake despreader or 'finger 1 at the appropriate time offset 
(corresponding to a particular time delay) in order to simultaneously 'capture 1 the energy 
from all of the transmit antennas, However, in order to ensure that this simultaneous 
despreading takes place, it must be ensured that not only is the same Walsh code used at 
each transmit antenna, but also the relative delay between the different transmissions is 
minimised, as they are received at each receiver antenna. Assuming that only a small 
contribution of this relative delay comes from the transmit and receive antenna separation, 
the major effect will be due to differential delays in the multiple transmitter chains, and in 
particular in the SAW filters. 

20 In order to carry out demodulation and decoding of the Space-Time Coded signal, the 
receiver needs to know the complex channel weight between every transmitter-receiver 
antenna pair, for every delayed time-tap. The 'brute-force* way to achieve this would be for 
the transmitter to transmit multiple pilots, one from each antenna, where each of these pilots 
uses up a separate Walsh code. The receiver then tracks each of these pilot transmissions 

25 at each receiver antenna. An alternative, but essentially similar technique is for the 
transmitter to transmit multiple 'Auxiliary Pilots'. The principle of Auxiliary Pilots (AP) Is to 
use the same pilot Walsh code at each transmit antenna (e.g. the 'all 1s 1 Walsh code, W), 
but to BPSK modulate it using an 'outer Walsh code". In the simplest case, for two transmit 
antennas, the sequence W,W,W,W,... would be transmitted from the 'main' transmit antenna, 

30 and W f -W,W t -W from the 'diverse' transmit antenna. Thus, even though the pilots would 

use the same Walsh codes, W, they would remain orthogonal over multiples of two Walsh 
symbol periods. In this way, the same channel estimation performance can be maintained, 
yet using the same Walsh codes at both transmit antennas. The only (minor) drawback of 
this arrangement is that It limits the maximum rate of channel Doppler that can accurately be 

35 tracked at the receiver, since it relies on this orthogonality property over two Walsh symbol 
periods, rather than just one, - 
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A further alternative Is to use the technique described below which permits decoding and 
channel estimation to be carried out by processing the same transmitted information (i.e. 
non-training) symbols. This reduces the length of the training sequences which are required 
5 and thereby provides increased system capacity. 

In order to improve the performance of the space-time decoder, iteration between channel 
estimation and decoding can be performed. In summary, the technique involves starting with 
a poor quality channel estimate (due to some small amount of training), decoding the data, 
10 and then using this decoded data as a "noisy" training sequence for another round of 
n channel estimation and so on. 

^ This iterative procedure improves upon the conventional system in that the decoded data is 

gg used to improve the channel estimates, which in turn can be used to improve the quality of 

15 the decoded data. This allows a reduction in the amount of training or pilot symbols which 
are required, since the initial channel estimate does not have to be of the same quality that 
would be required for the conventional receiver. This In turn improves the spectral efficiency 
of the system. 
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i 20 Figure 12 shows a schematic representation of an exemplary STC communications system. 

The system has a transmitter 100 and a receiver 102. The transmitter includes an STC 
encoder 104 which takes data d and produces symbols xi and x 2 which are spread using 
code W 2 and fed to two respective transmit antennas 106-1 and 106-2. (For simplicity, only 
two transmit antennas and one receive antenna 108 are shown. The technique is equally 
25 applicable to other numbers of antennas such as the (4,4) configuration described above). 
The transmitted symbols are despread in the receiver 102 and then processed (as described 
in detail below) in a joint channel estimation/decoding process performed in channel 
estimator 110 and decoder 112. 

30 For illustrative purposes, it is assumed firstly that packets of L symbols are transmitted over 
a quasi-static flat fading channel i.e. H is constant for each packet duration, but varies from 
packet to packet, and secondly that M-PSK modulation is used; the symbols transmitted 
from each antenna being in the modulation set X={e 2dWW , m=1»2,-. M M}. 

35 The iterative steps performed by the channel estimator 110 and decoder 112 are illustrated 
in the flowchart of Figure 13 and are described below with reference to that figure. 
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Step 200; Set the iteration number / = 0. 
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Step 202; Calculate the initial channel estimate # (0) . This may be estimated in 
conventional ways, for example, based on a known training sequence fed to the transmit 
antennas at the beginning df each frame, or based on pilot orthogonal Walsh codes. The 
estimation may be done using any standard channel estimation algorithm, e.g. least squares 
estimator. A further alternative is to add known data into the input data stream which 
causes the encoder to genei ate known channel symbols. This is similar to the training 
sequence technique but has advantages in terms of reduced complexity. 



It should be noted that this ' 
since it only has to be good 
Thus very few training/pilot 
specified, conventional receiver 



(APP) algorithm and 



= 1.2,...,/.. From these probi 



>ootstrap' channel estimate does not have to be very good, 
jnough to allow the iteration described below, to get started, 
symbols are required; certainly less than in an equivalents 



Step 204; Decode data prefe ably using a 'soft* algorithm such as the a-posteriori probability 



current channel estimate to produce a sequence of coded 



25 symbol probabilities Pr^x^M) or each transmit antenna y = 1 ,2, . . . ,f and each symbol Interval / 



Cities calculate the sequence of fif\T\ = SjcPt 10 (*,[/] = x) 
coded symbol averages to produce symbol estimates (step 206). rtlc 
30 Optionally, 'harden 1 the probabilities by using a 'slicing' technique, for example, by setting 
the symbols with the highest probability to 1 and all other symbols to 0 and using these to 
perform a *hard* cancellation in step 214 below. Alternatively, the decoding could be 
performed using a Viterbi decoder or other decoder which directly produces 'hard' outputs. 



35 Step 208; if this is the last decoding step (because the channel estimate result has 
'converged' to a substantially invariant estimate in the previous iteration i.e. no further 
improvement in the estimate is obtained by further iteration), the iteration loop is exited. The 
decoder may now be arranged to output several different types of information depending 
largely on the decoder requirements. For example, hard decisions on the coded symbols 

40 may now be made (step 210) using the latest channel estimate, which provides the 
information symbols. However, It will be noted that in step 204, the coded symbol 
probabilities are calculated during channel estimation and these 'soft' outputs may (once the 
iteration loop has been completed) be used to provide the inputs to another decoder in a 
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serially-concatenated (S1SO) decoder such as that described below (in connection with 
Figure 9). A third option is to use probabilities of the information bits (which will usually be 
generated as a by-product of the APP decoding process of step 204) to make hard decisions 
directly about the Information bits. 

Before convergence occurs, the channel is estimated in an iterative loop as follows, 
Step 212; For j = 1 to e(loop over transmit antennas) 

Step 214; Perform the soft cancellation Hfl-jffl-2^*M^ where ^ is raW * of 



H {i) (the gains between antenna k and each of the receive antennas) where soft decoding 
is performed in step 204, or otherwise perform hard cancellation. This step attempts to 
remove the effect of transmit antennas other than antenna/ In the soft case, by cancelling 
the average symbols (a soft cancellation) the effect of uncertain symbols on the estimate is 
automatically reduced. In the extreme case in which the decoder has no idea what a 
particular symbol is, the average for that symbol will be zero, and no cancellation will be 
performed. However, simulations suggest that the reduced complexity of hard decoding 
does not significantly reduce the channel estimation speed or quality. 

Step 216; Remove the expected effect of the modulation on antenna v according to 



where cony denotes complex conjugate. Note that this produces one channel estimate per 
symbol. Multiplication by the complex conjugate of the average symbol can be thought of as 
"squaring our the modulation. This requires a constant modulus modulation such as PSK to 
be used, since in that case for any unit power modulation symbol a we have cony(a)a=1 . 

The algorithm could be modified to deal with other modulation sets, but it would no longer be 
as simple as multiplying by the conjugate of the average symbol (e.g. division by the 
average symbol could be used although this would result in noise enhancement). 

Step 218; Average the channel estimates £j' +1) = (j — \\T*hTV] where the division by 
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scales the channel estimate according to the reliability of the decoder output. If the channel 
changes during the packet a moving average filter may be used. 

Step 220; End For Loop. 

Step 222; Form a new channel estimate using as row j the h<j M) just 

calculated. 



Step 224; / = f+1. Repeat steps 204 to 224 until convergence. 

In this way. channel estimation is performed using information symbols which are also 
decoded as part of the estimation process. Thus a very short training sequence (or the other 
15 options discussed above in connection with step 202) may be used which is just long 
enough to produce an adequate initial estimate in step 202. This is because the relatively 
coarse initial 'training 1 Is refined using what are conventionally considered to be 'non-training' 
symbols. As discussed above, reduced training sequence lengths results in desirable 
system capacity increases. 

20 

In effect, this technique releases information about channel gains that has always been 
present in information symbols (since the information symbols have been transmitted over 
the channels in the same way as training symbols) but which has hitherto been ignored. 

25 Of course, system capacity gained in this way would be of little value If the technique 
reduced system performance significantly. Figure 14 shows a simulation result indicating 
the relationship between pilot power and decoder performance for a conventional receiver. 
The horizontal axis carries the ratio of the pilot power to the signal power in decibels. The 
simulation parameters are r * 2, r = 1 and EJNo = 14dB. The four-state code from V. 
30 Tarokh, N. Seshadrl and A. Calderbank, "Space-time codes for high data rate wireless 
communication: Performance criterion and Code Construction," IEEE Trans. Inform. Theory 
44(2):744-764 March 1998 is used. From this figure we can see that as the pilot power is 
reduced, the performance of the conventional receiver degrades. In order to achieve 
performance close to that with perfect channel knowledge, a pilot to signal power ratio 
35 greater than -10 dB is required. From the point of view of training symbols, this means a 
training overhead of at least 10%, which reduces spectral efficiency. 
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Figure 15 shows indicative performance results for the proposed system. The system 
parameters are t - 2, r = 2. The same four-state code as that of Figure 14 is used. Only two 
training symbols are used and APP decoding and soft-cancellation are implemented. The 
horizontal axis shows the signal to noise ratio, Eb/Wo- The vertical axis shows the measured 
5 error rate (frame error rate for the top set of curves and bit error rate for the bottom set). The 
solid lines show the performance with perfect channel knowledge. The dashed lines show 
the error rate measured at the output of the iterative receiver for 1,2,3,4 iterations. The FER 
performance of perfect channel knowledge is achieved after only two iterations. 

1 0 Thus the proposed receiver reduces greatly the amount of training required compared to the 
conventional receiver (for packets longer than 20 space-time symbols). The exact saving 
depends on the fade rate, since this determines how often the training data must be sent. 

Returning to the STC code, an alternative approach to obtaining a code with a large number 
of states (e.g. >256) is to serially concatenate a pair of low-complexity trellis codes. It is 
important to interpose the inner and outer coders with an interleaver of large span. For 
optimal performance a single MLSE with a large number of states is required, in order to 
decode the whole aggregate concatenated code-word in a single 'pass'. However, a lower- 
complexity alternative decoding strategy has been seen in many cases to offer close-to- 
optimum performance, at least for the case of serially-concatenated time-only (i.e. non-STC) 
codes. The basis of this latter technique, based on the 'turbo* principle, is to decode the two 
constituent ('inner 1 and 'outer*) codes individually using Soft Input Soft Output (SISO) 
decoders. 

25 These SISO decoders should ideally be based on the APP (A Posteriori Probability) 
algorithm, which itself is a development of the BCJR MAP algorithm (Bahl LR. f Cocke J., 
Jelinek F. and Raviv *L: 'Optimal Decoding of Linear Codes for Minimizing Symbol Error 
Rate*, IEEE Transactions on Information Theory, March 1974). The extrinsic soft outputs of 
each constituent decoder provide intrinsic soft inputs to the next decoder (if the channel 

30 estimation technique described above is used, soft outputs of the joint decoder/estimator are 
used to feed into the second decoder). In this way it is hoped that the two decoders will 
iterate towards the global overall MAP sequence detection (which should be close to MLSE). 
The specification of a good overall Serially-Concatenated Space-Time Trellis Coder (SC- 
STTC) structure, based on sound optimisation criteria, is complex. In Figure 9 a coding and 

35 decoding structure is shown. One feature of the present invention as discussed in 
connection with EDGE, is that channel dispersion is determined as a 'decoding 1 problem 
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rather than an 'equalisation' problem, combined with an iterated serially-concatenated SISO 
decoding scheme. 

Figures 5(iv) and 9 show a top level serial concatenated STTC architecture. As described for 
STC earlier, the transmitter employs Base Transceiver (BTS) modules suitably modified to 
independently drive a 4-element antenna array. A controller unit supplies data for 
transmission to a DSP card, upon which the STC coding and modulation for all 4 transmit 
channels will be performed. Data is then passed to four, for example. 1.26MHz bandwidth 
channellisers. and these signals are then up-converted to 2GHz for transmission. 



0 _ . 
^/The main candidate STC algorithms have been qualitatively described. The respective 
( performances in quantitative terrls, based on simulation results will now be discussed. The 
simulations were carried out at trie symbol-level, for a simplified system, whereby frames of 
space-time symbols were transmftted over a quasi-slatic flat Rayleigh fading MIMO channel. 
5 Each frame carried 400 information bits (800 in the case of BLAST). Performance is 
quantified in terms of required E t Wo at each receiver antenna for a mean frame-error-rate 
(FER) of 10%, where E b is the onergy per information bit. There is no simulation of fast 
power control, so received E„ i s also equal to transmitted E„ (making the normalising 
assumption, that each channel cc efficient has a mean square value of unity). For simplicity, 
20 it is also assumed that there is zero correlation between the various transmit-receive 
antenna paths, which assumes a certain minimum spacing between antenna elements at 
both the BTS and mobile. The re suits of these simulations are shown in Table 2. 

The results of Table 2 are also plotted in Figure 10, showing the trade-off of Eb/N e 
25 performance (power efficiency) sjgainst spectral efficiency. The missing 'third dimension' on 
the figure is that of decoder comblexity. After conversion from EJH 0 to Es/N 0 (which is the 
corresponding SNR metric) the! performance results can also be compared with the 
theoretical performance curves sllown earlier. However, it should be remembered that the 
underlying assumptions are different in the two cases, since, in the case of the fundamental 
30 performance curves, the frames ate assumed to approach infinite length, with a vanishingly 
small error rate, and a Space-Tim4 Code which adapts from frame-to-frame, whilst, for the 
simulation case, there are finite frames, a fixed code, and an allowed non-zero probability of 
frame error. 

35 Examining the results of Table 2 (ana Figure 10) it can be seen that the benchmark has a 
poor performance in the 1:1 case, with a required Eb/N 0 of 1 1.0dB. This is 8.8dB higher than 
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the static channel 
of diversity. This 
antennas (yieldinc 
antennas and usiipg 
of oniy2.2dB. 



performance, and is due to the high fade margin required due to the lack 
>erformance can be improved either by increasing the number of receiver 
both SNR gain and diversity gain) or by increasing the number of transmit 
STTD. or both. In the latter case, for STTD 2:2, we see a required Eb/N 0 



For the case of two receiver antennas, this 2.2dB Eb/N<> is the best that can be achieved for 
the algorithms simulated. However, it is achieved at the expense of a poor spectral efficiency 
of only 1bps/Hz. The spectral efficiency can be doubled to 2bps/Hz using Tarokh's STTC- 

0 32 code, but at the cost of a 2.4dB degradation in mean Eb/N 0 . If the number of code states 
is increased to 256 using the MC-STTC-266. such that the number of states is now equal to 
that in the benchmark case, the 2:2 performance is improved to an Eb/N 0 of 3.5dB. Thus, 
compared to the STTD 2:2 case there is a degradation of only 1.3dB, but the spectral 
efficiency is doubled. For a system which is predominantly bandwidth (Walsh code) limited, 

5 rather than power (interference) limited, then this 1 .3dB Eb/N e degradation would be a small 
price to pay. 

For the MC-STTC-256 case, for a receiver with four antennas, the power efficiency is further 
improved both achieving an Et>/N Q of -1.5dB, with the spectral efficiency achieving 4bps/Hz. 

>0 Using this code, the difference of an STTD 2:1 scheme to an MC-STTC-256 4:4 results in a 
8.7dB Eb/No reduction for a 4x increase in spectral efficiency. The comparison case of STTD 
2:1 could be improved by using a Turbo code Instead of a k=9 convolutional code. The 4:4 
STC case could also be affected by a similar amount if the MC-STTC-256 code was 
replaced by a serially-concatenated STTC code (SC-STTC). Thus the relative performance 

25 benefit of STC compared to a 'transmit diversity" baseline would remain about the same. 

he uncoded BLAST scheme showh in Table 2 performs poorly in terms of Eb/N 0 , but has a 
high spectral efficiency due to the absence of redundancy, and the use of parallel 
transmission paths. Separate frames of data (each carrying 400 information bits) are 
30 transmitted from each of the two transmit antennas. The stronger of the two frames is 
detected first, by spatially nulling oi it (zero forcing) the weaker transmission, before detecting 



the bits, A 'genie-aided 1 subtraction 
the receiver, before beamforming 
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process is then applied to perfectly subtract this signal at 
\o the weaker transmission (avoiding the requirement to 
null the stronger, since it has alieady been subtracted). Finally the noisy QPSK data is 
detected at the output of this beanformer, which is a noisy received signal from the weaker 
transmit antenna. 
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Looking at the coded BLAST result L Table 2, by application of coding, it can be shown that 
the performance of BLAST has beln improved by 4.9dB, down from 12.0dB to 7.1dB. Th,s 
has been achieved by applying 'baUline' convolutions! coding at the transmitter, aiong with 
a transfer of soft metrics from 4 beamformer pre-processor to the respective decoder. 
However this is achieved at the Lcpense of a reduction in spectral efficiency. Non-gen.e- 
aided simulation experience to date Indicates that in practice the performance of coded 
BLAST 2 2 is in fact some 2-3dA worse than this, and so the subtraction processing falls 
rather short of a magic genie, die to the effect of error propagation down the layers. This 
would suggest that the transmission from the stronger transmit antenna (for any g.ven frame) 
is actually more vulnerable (arjd therefore has a higher frame error probability) than the 
transmission from the weaker, fe to the 'noise amplification' In the receiver antenna nulling 
process. 



Table 3 shows a comparison 
MFLOPS (or MIPS if a fixeC 
153.6kbps data rate over th? 
analysis is pertinent to both 
number of simplifying assumpjtions 



The processing tasks are 
demodulator generates the Lfjg 
Viterbi decoder. The Viterbi 
based on these LLRs in ordir 
called 'survivor path'), and hsnce 



!f the complexities of the different STC algorithms in terms of 
-point device is used). These processing loads assume a 
link. A 1-tap channel is assumed, and so the complexity 
TDMA and CDMA implementations in this flat channel. A 
have been made. 



partitioned into 'demodulator' and Viterbi MLSE'. The 
j-Likelihood Ratios (LLR), also known as 'soft metrics', for the 
algorithm then performs a maximum-likelihood trellis search 
to find the highest probability path through the trellis (the so- 
the most likely information sequence. 



Examining the figures in Tab e 3. it is interesting to note that for the benchmark cases only a 
small percentage of the processing load is taken up by the demodulator and Maximal Ratio 
Combining (MRC). and the vast majority is in the Viterbi decoding (198.4 MFLOPS). Since 
the STTD 'de-rotation' preprocessing is similar in principle to MRC, this also doesn't add 
much to the overall processing. For all of these cases the Viterbi processing load is the 
same. 

For STTC-32 the Viterbi processing load is much lighter than for the benchmark case due to 
the reduced number of states in the trellis. However, the amount of demodulator processing 
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is increased due to the larger number of hypotheses regarding possible transmitted space- 
time symbols. This is a natural consequence of the higher spectral efficiency of our code. 

The Viterbi processing load for the MC-STTC-256 code is much greater than for STTC-32, in 
5 direct proportion to the increase in the number of states. As the number of transmit antennas 
Is increased to 4 there is a significant increase in the demodulator processing load, and the 
demodulator load becomes of a similar order to the Viterbi processing load. For 8 transmit 
antennas there is an extremely large processing load of around 71 GFLOPS, a figure which 
would almost certainly rule out the use of such a code with so many transmit antennas for 
10 the foreseeable future. The reason for this increased demodulator processing is the huge 
increase in the number of possible transmitted STS, and hence the large number of 
hypotheses of possible received STS. 

For coded BLAST the Viterbi processing load doesn't change with the number of transmit 
15 and receive antennas since the fundamental information rate remains the same. The 
demodulator processing load, which is shown broken down into 'weight estimation', 
, beamforming , and 'subtraction' processing, is In all cases significantly less than the Viterbi 
MLSE processing, even for the case of 8 transmit antennas. This indicates that for a system 
where it is wished to employ such a large number of antennas at each end of the link then, 
20 for a given spectral efficiency, coded BLAST is far preferable to STTC in terms of 
complexity, albeit that it will suffer a performance loss, for the type of receiver processing 
described herein. 

^(n STC scheme which is not analysed In Table 3 is SC-STTC. However, experience with 
25 ( binary concatenated codss. such as Turbo codes, would indicate that the decoder 
complexity would certainly be well within an order of magnitude of the 'equivalent* non-Turbo 

The demodulator complexity (i.e. generation of soft metrics) for 
be of the same order as for the equivalent STTC. 



(e.g. convolutional) code 
SC-STTC should therefore 



30 The OSTBC techniques offer power efficiency benefits through diversity, although not 
through coding gain. Thus, there will be an application in situations where power efficiency is 
at a premium, and inherent diversity is low, such as those cases where there Is a flat fading 
channel and only a single terminal antenna. Conversely, due to their low spectral efficiency, 
they are not of interest in applications where there is already significant diversity and SNR 

35 gain through array antennas at the terminal. 
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Of the Space-Time Coding approaches actually simulated, the STTC techniques appear to 
offer the best trade-off of performance versus complexity for the applications of interest. The 
performance they offer goes some way to fulfilling the expectations raised by the 
fundamental capacity analyses earlier, since there is an improvement in Eb/N 0 of around 
8.7dB compared to an STTO 2:1 arrangement. Optimal STTC codes are known for the 2:2 
case, but not for 4:4. However, a simply-derived STTC code based on the 1/2-rate 
convolution encoder, namely the MC-STTC-256, has a performance which will provide a 
close to optimum performance for this class of coding at this complexity (probably within 1 
dB). 
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The layered (BLAST) codes, as implemented in the simulator, offer poor performance. This 
is largely due to the loss of diversity, which arises due to the antenna nulling process, and 
error-propagation effects due to imperfect subtraction. It is believed that coded BLAST 
schemes will always prove to be inferior to STTC due to this diversity loss for the non- 
5 optimal decoders described here, and because the decoders are inefficient, since they do 
not 'share' information in order to carry out joint estimation. However, for those applications 
which might be operating with large numbers of antennas (8 or more) at each end of the link, 
BLAST becomes more attractive than STTC, due to the significantly lower demodulation and 
decoding processing requirement (albeit with poorer performance). Thus STTC (or its 
20 serially-concatenated variants) is most appropriate for small numbers of transmit antennas 
(up to 4, say), but when there are larger number numbers of transmit antennas then it could 
be that BLAST is more appropriate. 

Based on experience with conventional time-only convolutional codes, it is to be expected 
25 that the performance of even low-complexity serially-concatenated codes (SC-STTC) would 
outperform the non-concatenated STTC codes of even quite high complexity. The benefits of 
a non-concatenated structure relative to a concatenated structure, for a given decoder 
complexity, would, perhaps, be something of the order of 1-2dB. 

30 [ With respect to the implementation Aspects of the invention, the following notes are relevant: 
SAW filters: There is a requirement within STC for very tight control of delay through the 
multiple transmit chains. The mair 



source of any such delay is believed to lie within the 
transmit SAW filters, since they are required to have steep roll-off skirts within their 
frequency response. SAW filters far these applications typically have a mean group delay 
35 (averaged over the pass-band) of tr e order of microseconds, for example. This mean group 
delay is much larger than the typksal chip period for wideband CDMA applications. SAW 
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filters also exhibit a considerable ripple in their frequency and phase (and hence group 
delay) response across their dass-band due to triple-transit effects. However, it is believed 
that the wide-band nature of tile CDMA waveform will mean that these delayB are averaged 
out in the time domain, and tHat it is the mean delay which is of most concern, in particular 
5 how this can be matched! across devices from the same, and possibly different, 
manufactured batches. SA\t filter delay problems can be minimised by an appropriate 
choice of SAW filters which afe well matched in terms of mean delay. An alternative could be 
to carry out some delay compensation in the transmitter DSP. 

0 i nrai Oscillator <LO) lock: In order to maintain orthogonality between the different Walsh 
codes over a symbol period, and ensure that a single despreader at the receiver can be 
used to receive multiple transmissions from the different transmit antennas, it is important 
that the carrier frequencies of the different transmissions are closely controlled. This 
effectively means that the phase noise must be controlled, and that the LO offset between 

5 different transmit chains be much less than the reciprocal of the symbol period. Preferably 
different transmit chains have LOs locked to a common reference. 

Amniitude imbalance: The Space-Time codes are designed assuming equal mean power 
from each of the transmit antennas. If this cannot be maintained exactly, then it will probably 
20 not be critical, as long as any gain imbalances suffered by an STC encoded traffic channel 
are also suffered by the pilot. Thus, the Space-Time decoder can take these imbalances into 
account when calculating the soft metrics, and will in fact not be able to distinguish transmit 
gain imbalances from propagation gain imbalances. However, there will be a reduction in 
the fundamental capacity of the MIMO transmitter receiver link. 

25 

Pilot (and Auxiliary Pilot) de-soreadino: There is a requirement for the receiver to track (i.e. 
de-spread) all of the pilot transmissions from all of the transmit antennas, at each of its 
receive antennas, for N L paths. If the system has N T transmit antennas, N R receiver 
antennas, and N L time-dispersed taps in the channel, then it can be seen that the DSP 
30 processing load to carry out this pilot de-spreading scales in proportion to N-^Nr-Nl, if a 
different pilot Walsh code is employed on each transmit antenna. However, considerable 
DSP complexity savings are possible if the Auxiliary Pilot (AP) principle is used, since the 
same fundamental Walsh sequence is then used at each transmit antenna. 

35 Traffic channel de-soreadino: As mentioned above, in order to carry out de-spreading of a 
traffic channel, the receiver must place a rake 'finger' at the appropriate time offset 
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(corresponding to a particular time delay) in order to simultaneously 'capture" the energy 
from all of the transmit antennas. Again, this effectively involves a 'correlation' or 'matched 
filtering' process over the period of a Walsh code ('Walsh symbol'), which, depending on the 
symboling rate chosen, may be somewhere typically in the range 64-256 chips. 

fifiarcher-finaer workload: Whilst carrying-out demodulation, the terminal receiver must also 
be able to search for new time-dispersed multipaths. Applying a brute-force approach, this 
would involve processing which scales by Nt*Nr. 

DLL jitter: DLL techniques will be used at the receiver to track the timing of the various 
multipaths. In order to be able to track time-delayed multlpaths with time delays changing at 
a rate commensurate with the channel Doppler, the loop-bandwidth of the DLL will need to 
be reasonably wide. 
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